Contributions to Kernel Equating

نویسنده

  • BJÖRN ANDERSSON
چکیده

Andersson, B. 2014. Contributions to Kernel Equating. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences 106. 24 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-554-9089-8. The statistical practice of equating is needed when scores on different versions of the same standardized test are to be compared. This thesis constitutes four contributions to the observedscore equating framework kernel equating. Paper I introduces the open source R package kequate which enables the equating of observed scores using the kernel method of test equating in all common equating designs. The package is designed for ease of use and integrates well with other packages. The equating methods nonequivalent groups with covariates and item response theory observed-score kernel equating are currently not available in any other software package. In paper II an alternative bandwidth selection method for the kernel method of test equating is proposed. The new method is designed for usage with non-smooth data such as when using the observed data directly, without pre-smoothing. In previously used bandwidth selection methods, the variability from the bandwidth selection was disregarded when calculating the asymptotic standard errors. Here, the bandwidth selection is accounted for and updated asymptotic standard error derivations are provided. Item response theory observed-score kernel equating for the non-equivalent groups with anchor test design is introduced in paper III. Multivariate observed-score kernel equating functions are defined and their asymptotic covariance matrices are derived. An empirical example in the form of a standardized achievement test is used and the item response theory methods are compared to previously used log-linear methods. In paper IV, Wald tests for equating differences in item response theory observed-score kernel equating are conducted using the results from paper III. Simulations are performed to evaluate the empirical significance level and power under different settings, showing that the Wald test is more powerful than the Hommel multiple hypothesis testing method. Data from a psychometric licensure test and a standardized achievement test are used to exemplify the hypothesis testing procedure. The results show that using the Wald test can provide different conclusions to using the Hommel procedure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

IRT Observed-Score Kernel Equating with the R Package kequate

The R package kequate enables observed-score equating using the kernel method of test equating. We present the recent developments of kequate, which provide additional support for item-response theory observed score equating using 2-PL and 3-PL models in the equivalent groups design and non-equivalent groups with anchor test design using chain equating. The implementation also allows for local ...

متن کامل

An Alternative Continuization Method to the Kernel Method in von Davier, Holland and Thayer’s (2004) Test Equating Framework

von Davier, Holland and Thayer (2004) laid out a five-step framework of test equating which can be applied to various data collection designs and equating methods. In the continuization step, they present an adjusted Gaussian kernel method which preserves the first two moments. This paper proposes an alternative continuization method which directly uses the log-linear function from the smoothin...

متن کامل

An Evaluation of Kernel Equating: Parallel Equating With Classical Methods in the SAT Subject TestsTM Program

ETS, the ETS logo, and LISTENING. LEARNING. LEADING. are registered trademarks of Educational Testing Service (ETS). PRAXIS is a trademark of ETS. SAT SUBJECT TESTS and SAT REASONS TESTS are trademarks of the College Board. PSAT/NMSQT is a registered trademark of the College Board and the National Merit Scholarship Corporation As part of its nonprofit mission, ETS conducts and disseminates the ...

متن کامل

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design

The purpose of this study was to evaluate the effectiveness of the hybrid Levine equipercentile (Hybrid LE) and modified frequency estimation (MFE) equating methods in improving accuracy of equating as compared to the percentile rank frequency estimation (FE), kernel frequency estimation (Kernel FE) and percentile rank chained equipercentile (CE) equating methods under the common-item nonequiva...

متن کامل

A comparison of Van der Linden's conditional equipercentile equating method with other equating methods under the random groups design

To ensure test security and fairness, alternative forms of the same test are administered in practice. However, alternative forms of the same test generally do not have the same test difficulty level, even though alternative test forms are designed to be as parallel as possible. Equating adjusts for differences in difficulties among forms of the test. Six traditional equating methods are consid...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014